Goto

Collaborating Authors

 efficienttransformer forlanguageandvision


Long-ShortTransformer: EfficientTransformers forLanguageandVision(Appendix) ADetailsofNormComparisons

Neural Information Processing Systems

The first design helps the model focus more on the global context of the image as each patch could attend to the whole image areas. It reduces the local texture bias ofCNN.